National Repository of Grey Literature 18 records found  1 - 10next  jump to record: Search took 0.01 seconds. 
Automated Web Page Categorization Tool
Lat, Radek ; Bartík, Vladimír (referee) ; Malčík, Dominik (advisor)
Tato diplomová práce popisuje návrh a implementaci nástroje pro automatickou kategorizaci webových stránek. Cílem nástroje je aby byl schopen se z ukázkových webových stránek naučit, jak každá kategorie vypadá. Poté by měl nástroj zvládnout přiřadit naučené kategorie k dříve nespatřeným webovým stránkám. Nástroj by měl podporovat více kategorií a jazyků. Pro vývoj nástroje byly použity pokročilé techniky strojového učení, detekce jazyků a dolování dat. Nástroj je založen na open source knihovnách a je napsán v jazyce Python 3.3.
Web application for searching for documents related to given product
Ledniczky, Péter ; Povoda, Lukáš (referee) ; Burget, Radim (advisor)
The aim of this project is to create a web aplication that will automatically collect the available text content from the Internet. Afterwards it looks for the predefined keywords and according to their occurrence it analyzes whimsical text index. The evaluation results are then presented through graphs. Work is done using HTML, CSS, JavaScript, PHP and SQL.
Web as a Source for Automatic Creation of Morphological Dictionary
Bulka, Pavol ; Matějka, Pavel (referee) ; Smrž, Pavel (advisor)
Creation of natural language words is based on rules, which are generally complex. Often it is very difficult or even impossible to describe them precisely in a formal way. That is why we use a morpho­logical dictionary to process natural language. In this paper we discuss the creation of morphological dictionary from Slovak's top level domain web. We talk about web crawling, data processing for mor­phological analysis and data structures too. This document makes basic principle and conception of morphological analysis clear. Final system, which is described in this thesis, produces morphological dictionary. This dictionary can be use in various application, for example spell checker, machine translation and so on.
Tool for Automatic Information Obtaning from the Web
Poliak, Jakub ; Harár, Pavol (referee) ; Povoda, Lukáš (advisor)
This bachelor thesis deals with programming of a tool for collecting positive and negative comments from one of the most popular Chinese e-shop to a database. It will be used for deep learning of an artificial neural network which should distinguish positive text from negative. Application was programmed in Java with the use of JSON-simple and jsoup libraries.
Crawler chassis for a forwarder
Plichta, Zbyněk ; Kašpárek, Jaroslav (referee) ; Škopán, Miroslav (advisor)
The first part of the bachelor´s thesis introduces crawler chassis used in forwarders and harvesters, its second part includes a concept design of a crawler chassis for a forwarder based on the given parameters. The design concentrates on the arrangement of the undercarriage and the choice of suitable track and idlers. Another intention of the design is to find an appropriate solution for suspension of bottom rollers with a sufficient travel, while observing small dimensions of the undercarriage. The outcome of the thesis is a drawing of the designed crawler chassis.
A Service for Verification of Czech Attorneys
Jílek, Radim ; Glembek, Ondřej (referee) ; Szőke, Igor (advisor)
This thesis deals with the design and implementation of the Internet service, which allows to objectively assess and verify the reliability and diligence of Czech lawyers based on publicly available data of several courts. The aim of the thesis is to create and put into operation this service. The result of the work are the programs that provide partial actions in the realization of this intention.
Automatizované vyhledávání a uchovávání recenzí o produktech
Voráč, Tomáš
The diploma thesis deals with the problem of automated searching for reviews on web pages and also the saving of found reviews. In this work are described in detail possibilities of storing unstructured data and subsequent selection of the most suitable storage. The main part of the work deals with the analysis of HTML structure, so that it is possible to find the required information on the website. This work also deals with ways to determine the similarity of text strings in order to determine what product the review found belongs to. The Python programming language was used for implementation.
A Service for Verification of Czech Attorneys
Jílek, Radim ; Glembek, Ondřej (referee) ; Szőke, Igor (advisor)
This thesis deals with the design and implementation of the Internet service, which allows to objectively assess and verify the reliability and diligence of Czech lawyers based on publicly available data of several courts. The aim of the thesis is to create and put into operation this service. The result of the work are the programs that provide partial actions in the realization of this intention.
Analysis of Real-World XML Queries
Hlísta, Peter ; Holubová, Irena (advisor) ; Svoboda, Martin (referee)
The aim of this thesis was to gather and analyze the real-world XQuery programs. The data gathering process is usually performed using the crawler. Part of the thesis was to analyze different crawlers and to choose the most suitable one. The crawler was then modified, so it would not overload servers, gather the right data and be able to pause. Before main gathering two problems had to be solved - where to start the gathering and how long it will take. After the data were gathered, they were cleaned, corrected and validated. The subject of the analysis was usage of the XQuery language and its grammar symbols. We also analyzed the XML documents used by XQuery programs and outputs from the XQuery programs. The main contribution of this thesis is the amount of the gathered data (in comparison with other sources), as well as gathering XML documents which are being queried, using Analyzer for analyzing the real-world XQuery programs and running this real-world XQuery programs over gathered XML documents.
Analysis of Real-World XML Queries
Hlísta, Peter ; Holubová, Irena (advisor) ; Klímek, Jakub (referee)
The aim of this master thesis was to gather and analyze the real-world XQuery programs. The data gathering process is performed using the crawler. The thesis contains analysis of different crawlers and the most suitable crawler was chosen. The crawler was modified, so that it did not overload servers, gathered the right data and was able to pause. Before the data gathering we analyzed where to start gathering and how long should it took. When the data was gathered, they needed to be cleaned and validated. The subjects of the analyses were use of the XQuery language and occurrences of XQuery grammar symbols. Combination of the XML representation of XQuery programs and XPath expressions for querying this representation was used to perform these analyses. XQConveror was used to create this XML representation. The main contributions of this thesis are the gathered data and the first real-world XQuery programs analysis.

National Repository of Grey Literature : 18 records found   1 - 10next  jump to record:
Interested in being notified about new results for this query?
Subscribe to the RSS feed.